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Abstract 

This paper gives answers to questions formulated as open in the paper "On 
State Observability in Deterministic Finite Automata" by A. Mateescu and Gh. 
PSun. Specifically, it demonstrates that for all k > 2, the families of regular lan- 
guages acceptable by deterministic finite automata with no more than k semi- 
observable states, denoted by are anti-AFL's, and that the family S\ differs 
in the closure property under Kleene +. 

1 Introduction 

In 1987, Mateescu and Paun JJ1 studied state observability in completely specified de- 
terministic finite automata without non-accessible states. They classified the states of 
finite automata into three types: observable, semi-observable, and non-observable. A 
state q is said to be observable if there exists a string w such that 8 (q, w) is a final state; 
otherwise, q is said to be non-observable. Moreover, an observable state q is said to be 
semi-observable if there is a state 8(q,a) that is non-observable, for some input symbol 
a. (In what follows, we assume that each deterministic finite automaton is completely 
specified without non-accessible states and, therefore, call them simply deterministic 
finite automata.) They proved that the family of all regular languages accepted by com- 
pletely specified deterministic finite automata with all states being observable, denoted 
by 0, forms a proper subfamily of the family of regular languages which is not closed 
under union, intersection, complementation, concatenation, intersection with regular 
sets, A -free homomorphism, inverse homomorphism, mirror image, and right and left 
quotient. However, if all the regular languages are taken over a given alphabet E, where 
E is minimal for them, then the family of all such regular languages, denoted by 
is closed under union, concatenation, and left quotient (the other properties are the 
same as for ff). It is also not hard to see that if the automaton has a non-observable 
state, then there is an equivalent automaton which has only one non-observable state 
(by the minimization). Thus, we have the well-known result showing that any reg- 
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ular language is accepted by a deterministic finite automaton with no more than one 
non-observable state. 

On the other hand, considering the number of semi-observable states gives rise 
to an infinite hierarchy of families of subregular languages, denoted by ,%, k > 0. 
In addition, it is known that for k > 4, all the families ,% are anti-AFL's, i.e., they 
are not closed under union, concatenation, Kleene +, X -free homomorphism, inverse 
homomorphism, and intersection with regular sets. However, the properties of (some 
of) the other language families were left open. Note also that in comparison with S^, 
k > 4, ,% is closed under Kleene + and, therefore, is not an anti-AFL. 

This paper answers these questions and proves that except for 3~\ , all the families 
2T k ,k>% are anti-AFL's. 

2 Preliminaries and Definitions 

In this paper, we assume that the reader is familiar with formal language theory (see 
[2 1). For an alphabet (finite nonempty set) E, E* represents the free monoid generated 
by E, where the unit of E* is denoted by X. Set E+ = E* - {X}. Let REG denote 
the family of all regular languages. For other non-specified notions and notations, the 
reader is referred to 0. 

Let s/ — (Q, E, 8,qo,F) be a completely specified deterministic finite automaton, 
i.e., Q is a finite set of states, E is an input alphabet, 8 : Qxl,^ Qis a total (completely 
specified) transition function, qo G Q is the initial state, and F C Q is a (possibly empty) 
set of final states. Let L(s/) denote the language accepted by si ', i.e., L(s/) — {w G 
E* : 5 (go, w) G F}, where 8 is, as usual, extended to be from Q x E* to Q. 

In addition, in this paper we assume that all the states of Q are accessible, which 
means that for each q G Q, there exists w G E* such that 8 (qo, w) — q. In what follows, 
the notion of a deterministic finite automaton (DFA) stands for an automaton which is 
completely specified, deterministic, and without non-accessible states. 

A state q G Q is said to be observable if there exists a string w G E* such that 
8(q,w) G F . Otherwise, q is non-observable. An observable state q G Q is said to be 
semi-observable if for some a G E, the state 8(q,a) is non-observable. A DFA stf is 
said to be observable if all its states are observable, and the language L(g/) is said to 
be observable, too. Let G denote the family of all observable regular languages. For a 
given DFA &/, denote by so(A) the number of semi-observable states in si ' . For each 
k > 0, define the language family ,% = {L G REG : there is a DFA s/ such that L = 
L(s/) and so(A) < k}. 

An alphabet E is minimal for a given language L if L C E* and for each E' C E 
we have L — E'* ^ 0. Let and REG(E) denote the families of observable regular 
languages and regular languages, respectively, for which E is the minimal alphabet. 
Finally, define the language families in an analogous way. 
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3 An Overview of Known Results 



This section presents an overview of the main results proved in [1|. Let L C E* be 
a regular language, and let Init{L) — {w G E* : wy G L for some y G E*}. Then, the 
following characterization of observable languages is known. 

Theorem 1. A regular language L C E*, where E is minimal for L, is observable if and 
onlyifInit(L)=Z*. 

It immediately follows from this theorem that any observable language is infinite. 
In addition, it is known that if L C E* is finite, then E* — L is observable. However, the 
other inclusion does not hold. The following properties are known. 

Theorem 2. The family & is closed under Kleene +. 

Theorem 3. The family ff is not closed under union, intersection, complementation, 
concatenation, intersection with regular sets, X-free homomorphism, inverse homo- 
morphism, mirror image, and right and left quotient. 

Theorem 4. Let E be an alphabet with at least two symbols. Then, the family ^?(E) 
is not closed under intersection, complementation, homomorphism, inverse homomor- 
phism, mirror image, and right quotient. On the other hand, it is closed under union, 
concatenation, and left quotient. 

Theorem 5. 

• .% = 0U{<d}, 

• &k+h k>0, and 

• U>o^ = REG. 

Let L be a regular language. To count the smallest k such that L G we can use 
the following lemma. 

Lemma 6. Given a language L G ^ — 3?k-h the value ofk can be obtained algorith- 
mically, by constructing a minimal DFAfor L. 

The following properties are also known: 

• £?k is not closed under union, for k>2. 

• 5^ is not closed under concatenation, for k>2. 

• ^ is not closed under A-free homomorphism, for k>2. 

• ^ is not closed under intersection with regular sets, for k > 0. 

• is not closed under inverse homomorphism, for k>2. 

• ^ is not closed under Kleene +, for k>4. 

Except for Kleene +, where it is not solved for the families k = 1,2,3, it remains 
to solve these questions for S7{ . This is done in the next section. 
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4 Results 



This section answers the questions formulated as open in HI. It shows that except for 
the family 37[ , all the families ,% are anti-AFL's, for all k>2. 

Lemma 7. Families Sf\ and 2?\(£) are not closed under union, concatenation, X-free 
homomorphism, and inverse homomorphism. 



Union: Let L\,L 2 C {a 7 b}*, L\ = (a + b)a* and L 2 = (a+b)b*. Then, L 1; L 2 G 
£Fi because they are accepted by DFAs = ({s,r,f},{a,b},8i,s,{f}) and M 2 = 
({s,r,f}, {a,b},S 2 ,s,{f}), respectively, where 5i(s,a) = 8 { (s,b) = f, 5i(/» = /, 
Si(f,b) = 8i(r,a) = 8\{r,b) = r, and 82(s,a) = 8 2 (s,b) = f, 8 2 (f,b) = f, Sz(f } a) = 
8 2 (r,a) = 8 2 (r,b) — r. However, L\ UL 2 ^ 3T\ because the minimal DFA accepting it 
is ({1,2,3, 4,5}, {a,b}, 5,1, {2,3, 4}), where 5(1, a) = 5(1, b) =2, 5(2, a) = 3, 
5 (2, A) =4, 5(3, a) =3, 5(3, b) =5, 5(4,a) =5, 8(4,b) =4, and 5(5, a) = S(5,b) =5. 
Clearly, states 3 and 4 are semi-observable, which implies that L\ U L2 € =^2 — ^1 ■ 

Concatenation: Let L\ and L2 be two languages over {a,b} defined by the following 
two deterministic finite automata s$\ and s£ 2 , respectively (see Fig. [T]). As it is easy 
to complete the automaton, the non-observable states are omitted from now on. Then, 



L\,L 2 € S\, with {a,b} being their minimal alphabet, because only states 4 and 3' 
are semi-observable. As the minimal DFA accepting the concatenation of these two 
languages L\ and L 2 is as in Fig. [2] which has two semi-observable states, namely 4 



Figure 2: The minimal DFA for L\ -L 2 with two semi-observable states, 4 and 7. 
and 7, the language L\ ■ L 2 ^ 



Proof. 




Figure 1: DFAs A 1 nndA 2 . 




4 




Figure 3: DFAs for L and h(L), respectively. 



Note that considering two languages over different alphabets, this result can be 
proved more easily. Specifically, let L\ — a + and L2 = b + . Then, L\,L,2 <E but 
U- Lit ft.. 

X-free homomorphism: Let L C {a,b}* be a language defined by the following de- 
terministic finite automaton srf (see Fig. |3), and let h : {a,b}* — ► {a,b}* be a homo- 
morphism defined as h(a) = ab, h(b) = b. Then, L 6 2F\, but h(L) £ 2F\ because the 
minimal DFA accepting h(L) has two semi-observable states (see Fig. |3). 

Inverse homomorphism: Let L C {a,b}* be a language defined by the following deter- 
ministic finite automaton srf (see Fig. 3J, and let /; : {a,b}* — > {«,£>}* be a homomor- 
phism defined as = aba and h(b) = bab. Then, L € but h~ l (L) ^ See Fig. 
[4]for minimal DFAs accepting L and h~ l (L). □ 




Figure 4: DFAs for L and /1 1 (L), respectively. 



Lemma 8. Let E be an alphabet. The families £?2, ^(E), and S^(L) are not 
closed under Kleene +. 

Proof. Let L C {«,£>}* be a language defined by the following deterministic finite au- 
tomaton si (see Fig. g). Then, Le J 2 , but L+ £ S 2 . The proof for ^3 and ^(E) is 



a 




Figure 5: DFAs for L and L + , respectively. 



analogous. □ 
Lemma 9. Families £T\ and (E) are closed under Kleene +. 

Proof. Let L G ^(E), and let = (<2,E, 8,qo,F) be a minimal deterministic finite 
automaton such that L(^#) = L. Moreover, let g € 2 be the only semi-observable state 
of Apply the following algorithm constructing a deterministic finite automaton jV 
such that L{JV) =L + : 
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1. Add a A -transition from each final state to the initial state. 

2. Use the common algorithm for removing A -transitions. 

3. Use the subset construction to construct a deterministic automaton . 

4. Minimize the automaton JV' and remove all non-accessible states. 

Clearly, JY' = (2*2, L, 8', {q }, {X e 2 Q : X HF ^ 0}). Let jV denote the final au- 
tomaton, i.e., JV' where all non-accessible states (including the adjacent transitions) 
are removed. 

Let a € £ be such that k = 8{q,a) is non-observable in Then, if {q} is a state 
of JV , then also {k} = 8'({q},a) is a state of jV . In addition, because ^# is minimal, 
8(k,a) = k for all agI, and we have that {k} is a non-observable state. Thus, {q} is 
a semi-observable state of JY . 

Assume that ^ X C Q is a state of J/ different from {q} (clearly, is non- 
accessible). Let p€X such that pj^q. As pis observable which is not semi-observable 
in Ji, there exists w a G £*, for each a el, such that 8{p,aw a ) <= f ■ Thus, 5'(X,aw a ) (~l 
F^0 and, therefore, X is an observable state which is not semi-observable. □ 

The arguments of the previous proof also explain why the families k > 2, are 
not closed under Kleene +. Let p and q be semi-observable states of a deterministic 
finite automaton accepting L. Then, the deterministic finite automaton accepting L + 
constructed as in the previous proof can have more semi-observable states than the 
original one, namely {/?}, {q}, and {/?,<?}. 
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